Model Selection

Local deployment optimization

# Local deployment optimization

Bielik 4.5B V3.0 Instruct GGUF

Bielik-4.5B-v3.0-Instruct-GGUF is a Polish large language model released by SpeakLeash, converted from Bielik-4.5B-v3.0-Instruct to GGUF quantized format, suitable for local inference.

Large Language Model Other

Apriel 5B Instruct Llamafied

This is an approximate implementation version of the ServiceNow-AI's Apriel-5B-Instruct model converted to Llama format, compatible with mainstream fine-tuning frameworks for easier operation.

Large Language Model

Huihui Ai Gemma 3 1b It Abliterated GGUF

This is a quantized version of Google Gemma 3B model, optimized based on llama.cpp, suitable for running in resource-limited environments.

Large Language Model

Deepseek R1 GGUF

DeepSeek-R1 is a 1.58-bit dynamically quantized large language model optimized by Unsloth, adopting the MoE architecture and supporting English task processing.

Large Language Model English

E5 Base V2 Gguf

GGUF format file of the e5-base-v2 embedding model, used for tasks such as sentence similarity calculation, supporting a maximum context of 512 tokens.

Text Embedding English

Polka 1.1b Chat

The first Polish dialogue assistant model specifically designed for local deployment, based on TinyLlama-1.1B with extended Polish tokenizer and trained with DPO optimization

Large Language Model

Transformers Other

GPT NeoX 1.3B Viet Final GGUF

1.3B parameter GPT-NeoX model pretrained on 31.3GB Vietnamese data

Large Language Model English

Featured Recommended AI Models

AIbase

Empowering the Future, Your AI Solution Knowledge Base

English 简体中文繁體中文にほんご

© 2025AIbase